Multi-scaling mix and non-universality between population and facility density 
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The distribution of facilities is closely related to our social economic activities. Recent studies have 
reported a scaling relation between population and facility density with the exponent depending on 
the type of facility. In this paper, we show that generally this exponent is not universal for a specific 
type of facility. Instead by using Chinese data we find that it increases with Per Capital GDP. 
Thus our observed scaling law is actually a mixture of some multi-scaling relations. This result 
indicates that facilities may change their public or commercial attributes according to the outside 
environment. We argue that this phenomenon results from the unbalanced regional economic level 
and suggest a modification for previous model by introducing consuming capacity. The modified 
model reproduces most of our observed properties. 

PACS numbers: 89.75.Hc, 89. 75. Da, 89.40.Dd 



I. INTRODUCTION 

Facilities and infrastructures such as hospitals, schools, 
internet routers, origin from the development of human 
social civilization and in turn shape our modern daily 
life. This complex evolution process raises an interesting 
question that how these facilities are distributed and how 
they correlate with the whole social economic system. A 
better understanding on this issue could help to provide 
better public service and to save social opportunity cost. 
It is believed that population density and economic in- 
gredient are crucial on deciding the locations of these 
facilities But unevenly distributed population and 

economic level make the question very complicated. Al- 
though studies ranging from business economics, system 
engineering, computer science, geography to even biol- 
ogy have addressed on the issue, both theoretical basis 
and empirical demonstration are inadequate [IHUi • This 
causes the arbitrary assumption of uniformly distributed 
nodes in many spatial network models even though it is 
far from the reality [HI, [l3| • 

Intuitively, the number of facilities in an area increases 
with the corresponding population. The related studies 
can be traced back to a so-called p-median problem which 
aims to find the precise locations of facilities so that the 
mean distance that one reaches his nearest facility is min- 
imized [U, [l4[ . Numerical and analytic treatments have 
been used to suggest a relation D ~ p a where D is the 
facility density, p is the population density and a = 2/3 
[E EL 111 • This result accounts for the internet router 
distribution and the territorial division but does not con- 
sist with other empirical studies where although a univer- 
sal scaling D ~ p a is evidenced, the value of a, depending 
on types of faci lity, ranges from 2/3 to 1 rather than just 
fixed 2/3 0,II[l5f. More specifically, commercial facil- 
ity and public one has a. = 1 and a = 2/3, respectively, 
while for facility with both attributes, the exponent lies 
intermediate. These findings stimulated a recent study in 
which a general model based on the tradeoff between the 
profit (commercial concern) and the social opportunity 



cost (public service) was proposed [lj|. The value of a 
in that model represents the type of facility, which is de- 
scribed by the relative weight of commercial and public 
attributes. In other words it assumes a universal scaling 
exponent for a specific type of facility. (Actually all the 
previous studies tacitly approve this assumption.) Thus 
if we choose a part of these facilities according to some 
other properties, the scaling exponent is expected to be 
unchanged in this sample. 

As we will show in this paper, this is not always true. 
At least in Chinese cases, samples from a specific type of 
facility according to Per Capital GDP yield an increas- 
ing scaling exponent. Therefore the macroscopic power 
law relation between facility and population density is 
actually a mixture of different scaling functions. This 
multi-scaling property indicates a different picture that 
the attribute of a facility changes with the outside en- 
vironment and consequently affects its real distribution. 
In the next section we will provide empirical evidences 
for the above arguments. And in section III, we will try 
to offer an explanation and present a modified model to 
reproduce the multi-scaling property. 



II. EMPIRICAL STUDY 

We have gathered 7-year empirical data (2001 — 2007) 
including the positions of 4 typical types of facility (hos- 
pital, post office, school and theater), the population, 
GDP and the area of every county. Despite of the 
temporal fluctuations, there are totally more than 287 
counties and about 60500 hospitals, 56300 post offices, 
309200 schools and 5000 theaters. All these data come 
from the CHINA CITY STATISTICAL YEARBOOK 
whose electronic versions can be found and downloaded 
at 



http:/ /ishare.iask.sina.com.cn| These data allow us 



to calculate the population density, facility density, Per 
Capital GDP and their correlation at county level. Such 
coarse-grained treatment was also applied in the previ- 
ous study due to the resolution limitation of the facility 
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FIG. 1: (a) The scaling law between facility density (D) and pop- 
ulation density (p) for the cases of post office and theater in 2007. 
The data is logarithmic binned and is plotted on the log-log scale. 
The dotted lines are the corresponding fits, which are measured as 
D ~ p 7 (blue) and D ~ p0.98 (red), (b) The yearly scaling expo- 
nents for all the four facilities. Dotted lines are their corresponding 
averages calculated as 0.71,0.96,0.83,0.77 for post office, theater, 
school, hospital, respectively. 



positions [T|, . 

Our first observation consists with the previous stud- 
ies. Indeed a scaling relation between population and 
facility density emerges in all the four types of facility 
(Fig. HJa)). A detailed analysis on their scaling expo- 
nents indicates that despite of the yearly fluctuations, 
the exponents stay around their own averages which are 
measured as 0.71,0.96,0.83,0.77 for post office, theater, 
school, hospital respectively. According to Ref. (l5j, post 
office and theater whose scaling exponents are close to 
2/3 and 1 can be viewed as the representative of public 
and commercial facility. This is consistent with our ex- 
perience that post office disregarding profit is necessary 
everywhere while theater behaves oppositely. 

To examine the universality assumption of scaling ex- 
ponent, we classify the counties into four different classes. 
Specifically, we calculate the Per Capital GDP for each 
county and divide their logarithmic values into four equal 
intervals. Those counties whose Per Capital GDP lie in a 
common interval belong to the same class. Consequently 
counties in class 1 have the lowest Per Capital GDP while 
those in class 4 have the highest one. This classifica- 
tion distributes about 40 counties in class 1 and over 70 
ones in each of the other three. Although the number 
of counties in class 1 is almost half less, the number of 
the corresponding facilities is still larger enough to apply 
statistical analysis. Another problem is whether the Per 
Capital GDP correlates obviously with the population 
density so that our method might cause serious statis- 
tical bias. This possibility is basically ruled out as the 
correlation turns out to be very weak (correlation coeffi- 
cient < 0.25). 

For each type of facility, we study their scaling rela- 
tion in different classes. As illustrated in Fig. [2J in each 
class the relation between population and facility den- 
sity is still a power law. But their scaling exponents 
are not equal but increase clearly with the Per Capital 
GDP level (class number). This result contrasts with the 
universality assumption of the scaling exponent in Ref. 
fl5| . Instead it indicates that the observed scaling rela- 
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FIG. 2: Multi-scaling relation for the case of hospital in 2007. 
The class from 1 to 4 describes the increase of the Per Capital 
GDP level. For each class the data is logarithmic binned and is 
plotted on the log-log scale. Dotted lines are their corresponding 
fit plotted here for guiding eyes. The data for different classes is 
undrawn a little for better visualization. For each class D and p 
still follows power law but the exponent increases clearly with the 
class number. The result indicates a multi-scaling picture rather 
than universality of the scaling exponent. 



tion D ~ p a is actually composed of some multi-scaling 
behaviors. If we accept the physical meaning of a in- 
terpreted in Ref. (l5| . the multi-scaling phenomenon re- 
veals an interesting fact that facilities can change their at- 
tributes according to the outside economic environment. 
Particularly, they tend to be commercial in high Per Cap- 
ital GDP level area but still provide necessary public ser- 
vices in poor developed places. Further detailed analysis 
suggests that this multi-scaling property as well as the 
positive correlation between the multi-scaling exponents 
and Per Capital GDP level occurs every year in all the 
types of facility regardless of their temporal fluctuations 
(Fig.©. For school and hospital (Fig.^a) and Fig.^b)), 
the exponents vary from 2/3 to 1, which almost covers 
the whole possible range. In contrast, the exponents of 
post office are much more stable. As shown in Fig. Efc) 
they stay near 0.7 and rise up to no more than 0.85. Sim- 
ilarly in Fig. [3jd) , the range of the exponents of theater 
narrows around 0.9. It seems that purely commercial or 
public facilities are not likely to behave diverse attributes, 
which is somewhat consistent with our intuition. 



III. EXPLANATION AND MODEL 
MODIFICATION 

All these findings are not captured by the previous 
models and thus require a more complete theory. As 
will be presented later, a small modification by introduc- 
ing the consuming capacity can reproduce most of these 
properties. Before that, we will first review the model of 
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FIG. 3: The relation between the multi-scaling exponents (y-axis) 
and the Per Capital GDP level (class number or classification as 
labeled on x-axis) for four types of facility: (a) school, (b) hospital, 
(c) postoffice, (d) theater. For all the four types of the facility the 
exponents increase with the Per Capital GDP every year without 
exceptions. For school and hospital, the exponents vary in a large 
range from 0.6 to 1. In contrast, the exponents of post office and 
theater range narrowly. 



Ref. [15| and clarify some useful concepts. 

Suppose we are given the total population N p and the 
total number of facilities Nf on a plane. And suppose 
people always visit their nearest facility, by which we 
can define the Voronoi cell Vi , whose area is s 2 ; , as the set 
of points closer to the i-th facility than to any others Q. 
This means that the number of visitors to the i-th facility 
is the number of people living in Vi, which is denoted as 
n j. Therefore the population density in Vi is calculated 
as pi = rii/si while the corresponding facility density Di 
is Di = I/si. 

For commercial facility, Ref. assumes that the 
profit of the i-th facility is proportional to n,. Therefore, 
a facility having lower m is better off moved to other 
location with higher population for higher profit. This 
strategy is applied by all the facility during the relocat- 
ing process. Finally the system will must reach an equi- 
librium so that every facility has almost the same profit, 
i.e. n, ~ Np/Nf. Then by using the expression pi and 
Di calculated above, we arrive at D ~ p. On the other 
hand, public facility concerns prior the social opportunity 
cost caused by the distance between visitors and facilities, 
which is described by Ti^r-i) with (r,) ~ ^fsl representing 
the average distance to the i-th facility. To provide better 
public service, facilities at lower-cost places should be re- 
located to those with higher TJj(rj). Then in steady state, 
rii{ri) becomes the same for all the facilities. Again by 
using the expression pi and Di, we have D ~ p 2 / 3 . For 
facilities with both attributes, Ref. [lj| defines a general 



(1) 



where f3 is tunable within the range [0,1]. Analogous 
to the above analysis, one can derive the final scaling 
relation D ~ p 2 /(P+ 2 \ which gives a = 2/{fi + 2). 

However in this nicely compact model, the assump- 
tion that profit equals to population is too simple. After 
all, the profit is yet closely related to the commodity 
prices and people's consuming capacity. Facilities pro- 
viding luxury commodity are not opened at poor places 
even though they are densely populated because no one 
there can bear such high level consumption. On the 
other hand, facilities in the regions with lower population 
but higher consuming capacity can still benefit from high 
prices. Therefore facilities with both attributes, due to 
their ingredient of commercial part, take chance to locate 
and gain at developed area even though the service there 
provided by their public ingredient is adequate enough. 
This causes the number of facility in a well developed 
area is larger than those in poor places even if they are 
equally populated, which indicates a more rapid increase 
of facility in high economic-level places. If we assume 
the consuming capacity has a positive correlation with 
Per Capital GDP, which seems plausible, the above ex- 
planation for multi-scaling does make sense. 

To make the explanation more convincing, it is useful 
to introduce an alternative expression of Eq.(l) to char- 
acterize the transition from public to commercial facility, 
expressed as 



An; + (1 - A)n,(r,}, 



(2) 



where A € [0, 1] is a tunable parameter controlling the 
relative weight of commercial or public attribute and con- 
sequently determining the final exponent a, just as the 
role of j3 in Eq.(l). Eq.(2) has the same physical mean- 
ing and similar effect to Eq.(l) but turns out to be more 
difficult to apply analytical treatment. However, one can 
still prove its scaling property by the method used in 
Ref. fig. To introduce the consuming capacity to the 
model, we denote rrii as the average expense consumed 
by every person living in Voronoi cell Vi so that the profit 
equals to mini. Then Eq.(2) is rewritten as 



Xrrnni + (1 - A)ni(rj). 



(3) 



The only modification compared to Eq.(2) lies in the 
first term of the righthand of Eq.(3), i.e. A — > Am;. 
This modification does not affect the macroscopic scal- 
ing law between D and p qualitatively, but changes the 
relative weight of commercial attributes on microscopic 
level. Particularly, for large rrii the first term XmiUi is 
enhanced by the effective weight Ami, which causes the 
system to concern more about profit. This leads the fa- 
cility to be more commercial and the scaling exponent 
to be close to 1. On the other hand if m* is small, the 
second term (A— l)n,(r,) takes over, then the system con- 
cerns more about social opportunity cost just like public 
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FIG. 4: Multi-scaling relation between Z) and p simulated by the 
modified model with fj, = 1. The data is plotted on the log-log 
scale. The class from 1 to 4 describes the increase of the level 
of m. The relation in each class is power law but the exponent 
increases with the class number just as in Fig. f2] Dotted lines 
are their corresponding fits plotted here for guiding eyes. Inset: 
The simulated multi-scaling exponent vs class number. For p = 
(red) and fi — > oo(purple), which represent the pure commercial 
and public facility, the exponents stable at 1 and 2/3, respectively. 
For intermediate /i, such as p = 1 (blue), the exponent increases 
with the classification. The result is averaged over 50 simulations. 

facility and the exponent becomes close to 2/3. There- 
fore even for the same A (i.e. the same type of facility), 
different m, leads to different system behavior. This is 
the reason why multi-scaling emerges and why their ex- 
ponents increase with the economic level. Moreover, if 
A = (purely public facility), Eq.(3) becomes in- 
dependent. So the system degenerates to the classical 
model in Ref.[ll| and thus displays only a single scaling 
relation with the exponent a = 2/3. On the other hand, 
if A = 1 (purely commercial facility), Eq.(3) depends to- 
tally on the first term. But to.; in this case has no effect 
on the exponent but only changes the coefficient of the 
scaling relation, leaving the only exponent a — 1 which 
is also independent of TOi[I3]. Therefore multi-scaling 
property can be less pronounced in a very commercial 
or public facility, which explains the narrow range of the 
multi-scaling exponents observed in post office and the- 
ater. 

If we still adopt the idea in Ref.[l5[ and follow the 
expression of Eq.(l), the above explanation indicates 
that the exponent /3 in Eq.(l) should be modified to 
be a function of parameter rrii, i.e. c, = rii(ri)P( mi \ 
Then the multi-scaling exponent can be calculated as 
2/(f3(m,i) + 2) and the macroscopic exponent a can be 
given by taking an average over all possible mi . To repro- 
duce the multi-scaling property by simulation, we apply 
the following simple formula of f3(m) 

/3(m) = 1 - {m/m max y, (4) 



FIG. 5: The relation between the scaling exponent a and the pa- 
rameter p. The green points are the simulation result which is 
averaged over 50 realizations. The red dotted lines are the analyti- 
cal prediction. Inset: The scaling relation between population and 
facility density simulated by our modified model with /j, = 1. The 
simulation data is logarithmic binned and is plotted on the log-log 
scale. Red dotted line is the fit measured as D ~ p 8 . 



where m max represents the possible maximum of to and 
fi is a parameter controlling the sensitivity of /3 with to. 
If /i = 0, the facility becomes purely commercial while 
if fx — > oo, it becomes purely public. So fi also controls 
the exponent a. The real-world (3(m) can be quite dif- 
ferent (probably related to the type of facility) . Finding- 
its precise expression could be a complicated task and is 
beyond the scope of this paper. The aim of our simula- 
tion is only to reproduce the multi-scaling phenomenon 
qualitatively. Our simulation follows the similar process 
to that of Ref.[l5| except for the above-proposed mod- 
ification. Specifically, we first distribute the population 
density p and the economic level m randomly on a plane. 
And then we further put some facilities of a certain type. 
At each time step, every facility calculates its benefit ac- 
cording to Ci — ni{ri)P( mi \ Facility with the lowest Cj 
then moves to the region with the highest benefit. This 
process repeats until the system is steady. At steady 
state, we measure various quantities and their relations 
just as done for empirical data. Note that the classifi- 
cation here is carried out according to the value of m. 
Other details of the simulation such as parameter setting 
are described in APPENDIX A. In Fig. [H we plot the 
simulated relation between population and facility den- 
sity for different classes (i.e. different level of m or say 
different interval of to). Clearly it displays multi-scaling 
property with the exponents increasing with class num- 
ber. The inset of Fig. [4] presents the results of the sim- 
ulated multi-scaling exponents. For purely commercial 
(/j = 0) or purely public (^ —¥ oo) facility, the expo- 
nents become stable at 1 or 2/3. But for intermediate 
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ft = 1, the exponents increase. In the inset of Fig. [5] we 
demonstrate that the whole scaling relation between D 
and p maintains in our modified model. And the scaling 
exponent a in the simulation decreases with the param- 
eter fi, as shown in Fig. O Note that all these results 
can be calculated analytically. We present an analytical 
solution of a{fi) in Fig. [5J which is in good agreement 
with the simulation. More details about the analytical 
calculations are reported in APPENDIX B. 



IV. CONCLUSION 

We have analyzed the scaling relation between popu- 
lation and Chinese facility density at different Per Cap- 
ital GDP levels. Our study does not see the universal- 
ity of the scaling exponent but instead suggests a multi- 
scaling picture. More interestingly, such multi-scaling ex- 
ponents increase with the Per Capital GDP regardless of 
the type of facility. These results indicate that facilities 
can change their commercial or public attributes accord- 
ing to the outside environment, i.e. they take chance 
to gain more profit in developed area but still fulfil their 
public-service responsibility in poor region. We have also 
provided possible explanation and suggest a modification 
by considering consuming capacity. The modified model 
can reproduce most of our observed properties. 

Our study stresses on existence rather than universal- 
ity. Indeed the multi-scaling property is observed every 
year for all the four types of facility. Therefore their oc- 
currence is certain rather than coincidental. On the other 
hand it is also appealing to explore whether this phe- 
nomenon occurs in other more developed country where 
the economic system is more balanced and stable. Due 
to the limitation of data, the present study cannot cover 
this area. If this point is evidenced, it indicates the multi- 
scaling could be a common property Otherwise it either 
requires explanations from sociocultural, economic, pol- 
itics, etc or leads us to a long-time dynamic evolution 
picture of facility allocation, both of which are signif- 
icant on understanding our social economic system or 
even providing guidelines to urban development. 

This work was partially supported by the National 
Nature Science Foundation of China under Grant Nos. 
11075057, 11035009 and 10979074. 



that the facility number changes with time during the 
simulation while p u and m u are always fixed as soon 
as they are distributed. Since the area of all u is 1, 
p u and D u is exactly the population density and facil- 
ity density respectively. The number of people visit- 
ing facility i in place it, denoted as rij U , is calculated as 
n%u = Pu/D u while the average distance for these people 
to travel to facility i, denoted as (ri u ), is calculated as 
(fiu) ~ \/~Siu = 1 / fl8l] ■ Then we can determine the 
benefit c iu = n iu {r m )^ m ^ ~ p u / D u +I3{niu)/2 for every 
facility i in the unit square u. Note that this quantity 
only depends on place u and is equal for any i within 
this unit square, so we can replace Ci U by the denotation 
c u . At each time step of our simulation, we calculate 
the c„ for each places according to the current D u (t). 
Then we eliminate a facility in the place with the low- 
est c u and create one in the unit square with the highest 
c u . This procedure repeats until the system reaches its 
steady state, at which the relation between D and p as 
well as the exponents is stable. 

We test this coarse-grained method by repeating the 
simulation in Ref. [l5| . i.e. setting /3(m) = constant 
as Eq.(l). We find the method behaves the same as 
the simulation in Ref. [l5j and reproduces all their re- 
sults. In our own simulation, we have 200 different unit 
squares. And we set p(p) = l/p with p <E [0,5000] and 
p(m) — 1/300 with m e [0,300]. The initial distribution 
of facility is also uniform with D(t = 0) € [0, 1000]. Wc 
choose power-law distribution of p because i). it coincides 
with the real population distribution which is observed 
to be heavy-tailed; ii). the power-law formula leads to a 
uniformly distributed data points on log- log plot, which 
gives a better visualization. Note that other distribu- 
tions do not change the simulation results qualitatively. 
By these parameters, we obtain (3(m) = 1 — (m/300) M . 

When the simulation reaches its steady state, we mea- 
sure various quantities and their relations just as done 
for empirical data. Note that the classification here is 
carried out according to the value of m. Specifically, we 
divide the value of m (not the logarithmic value of m) 
into four equal intervals. Those unit square u whose m u 
lies in a common interval belongs to the same class. Al- 
though there is a bit difference from what we have done 
for empirical data, it does not change our conclusion at 
all. 



Appendix A: simulation details 

We use a coarse-grained simulation for our modified 
model. We first divide a plane into many unit squares 
whose area all equal to 1 and assume that all the sit- 
uations in one unit square are approximately identical. 
Then we distribute for each unit square u the popu- 
lation p u and the consuming capacity m u according to 
the corresponding distribution p{p) and p(m). And we 
further distribute randomly the initial number of facil- 
ity D u (t = 0). We use denotation D u (t) to emphasize 



Appendix B: analytical calculations 

Since a = din(p) — - = (aj) TOi , the macroscopic expo- 
nent is exactly an average of cu over all possible mi, i.e. 
a = J m max p(m)a(m)dm, where a(m) = 2/(/3(m) + 2). 
Substituting the corresponding parameters and equation, 
we have 

f 1 2 

a(u) = / dx. (Bl) 
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This function gives a good agreement with 
the simulation, as plotted in Fig. [5] Partic- 
ularly, for fi = 0,0.2,0.5,1,2,5,10, Eq.(Bl) 
gives a = 1,0.926,0.866,0.81,0.76,0.712,0.692, 
which is consistent with the simulation data 
a = 1,0.91,0.85,0.8,0.755,0.71,0.68 in Fig. H 
We can also calculate the multi-scaling expo- 
nent for each class. The calculation is given by 
a(C) = f™ max p(m\C)a(m)dm, where p(m\C) is the 



conditional probability density under class C and equals 
to 1/75 in our simulation. m rnin and m max here is 
related to the specific class C. We calculate the multi- 
scaling exponents for \i = 1 and have the theoretical 
result 0.69, 0.76, 0.84, 0.94 for class 1, 2, 3, 4, which is also 
consistent with the simulation data 0.68,0.74,0.81,0.91 
in the inset of Fig. 2J 
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